Update LightRAG setup: Kimi 2.5 entity extraction & Cerebras+Qwen3 options by OnlyTerp · Pull Request #1 · OnlyTerp/hermes-optimization-guide

OnlyTerp · 2026-04-16T04:37:46Z

Summary

Replaces the single generic .env example in the LightRAG setup section with three clearly labeled configuration options:

Option A — Kimi 2.5 + Fireworks (recommended for quality)
Option B — Cerebras + Qwen 3 (recommended for speed)
Option C — Free local Ollama setup

Also adds LLM_BINDING_HOST for custom API endpoints, an "API keys" quick-reference with signup links, the entity extraction model comparison table (was missing from part3-lightrag-setup.md), and stronger recommendation copy. Updated both README.md and part3-lightrag-setup.md to stay in sync.

Updates since last revision

Rebased on main to resolve merge conflicts with the security-hardening commit (placeholder API keys now use <your-xxx-api-key> format throughout)
Preserved the chmod 600 security tip from main in README.md

Review & Testing Checklist for Human

Verify LLM_BINDING_HOST is a real LightRAG env var. This was not present in the original examples. If LightRAG uses a different name (e.g. LLM_BINDING_BASE_URL, LLM_BASE_URL), the Kimi and Cerebras examples will silently fail to connect. This is the highest-risk item.
Verify API endpoint URLs — https://api.moonshot.cn/v1 (Kimi) and https://api.cerebras.ai/v1 (Cerebras) should be confirmed against current provider docs.
Verify model name strings — kimi-2.5, qwen-3-32b, qwen3:32b (Ollama variant) — confirm these match what each provider actually expects.
Spot-check that both files are consistent — the entity extraction table, .env options, and recommendation text should be identical between README.md (Part 3 inline section) and part3-lightrag-setup.md. Note: part3-lightrag-setup.md is currently missing the chmod 600 security tip that README.md has — minor inconsistency.

Notes

The README.md already had partial Kimi 2.5 references and the model comparison table; the main changes there are restructuring the .env into labeled options, adding LLM_BINDING_HOST, and adding the API key links.
part3-lightrag-setup.md had larger changes — it was still referencing gpt-4.1-mini as the default and lacked the model comparison table entirely.
The troubleshooting "Slow ingestion" section in part3-lightrag-setup.md was also updated to recommend Cerebras + Qwen 3 / Kimi 2.5 instead of the old GPT-4.1-mini / Claude Haiku suggestion.
No code changes — documentation only.

Link to Devin session: https://app.devin.ai/sessions/be57baa807b64c75b7f3cfe3596230d8
Requested by: @OnlyTerp

devin-ai-integration · 2026-04-16T04:37:49Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

…lti-option .env - Replace single .env example with three clear options: Option A: Kimi 2.5 + Fireworks (recommended) Option B: Cerebras + Qwen 3 (fastest ingestion) Option C: Free local Ollama setup - Add LLM_BINDING_HOST config for Kimi and Cerebras endpoints - Add API key signup links for Moonshot, Cerebras, and Fireworks - Update prerequisites to recommend Kimi 2.5 and Cerebras+Qwen3 - Add entity extraction model comparison table to part3-lightrag-setup.md - Strengthen recommendation text with specific use-case guidance - Update troubleshooting slow ingestion to reference new models - Keep both README.md and part3-lightrag-setup.md in sync Co-Authored-By: Rob <onerobby@gmail.com>

devin-ai-integration

Devin Review found 1 potential issue.

View 2 additional findings in Devin Review.

devin-ai-integration · 2026-04-16T19:48:06Z

+> **Where to get API keys:**
+> - **Kimi / Moonshot:** [platform.moonshot.cn](https://platform.moonshot.cn) — sign up, create an API key
+> - **Cerebras:** [cloud.cerebras.ai](https://cloud.cerebras.ai) — free tier available, very generous limits
+> - **Fireworks:** [fireworks.ai](https://fireworks.ai) — sign up for an API key


🟡 Security tip for chmod 600 on .env file dropped from part3-lightrag-setup.md

The old part3-lightrag-setup.md (line 87) included an important security tip: > **Security tip:** Set restrictive permissions on this file: \chmod 600 ~/.hermes/lightrag/.env`. This was removed during the rewrite but was kept in README.md:674. Since the .env` file contains API keys, users reading only the standalone part3 guide will miss this security guidance, potentially leaving their API keys world-readable.

Suggested change

> **Where to get API keys:**

> - **Kimi / Moonshot:** [platform.moonshot.cn](https://platform.moonshot.cn) — sign up, create an API key

> - **Cerebras:** [cloud.cerebras.ai](https://cloud.cerebras.ai) — free tier available, very generous limits

> - **Fireworks:** [fireworks.ai](https://fireworks.ai) — sign up for an API key

> **Security tip:** Set restrictive permissions on this file: `chmod 600 ~/.hermes/lightrag/.env`

> **Where to get API keys:**

> - **Kimi / Moonshot:** [platform.moonshot.cn](https://platform.moonshot.cn) — sign up, create an API key

> - **Cerebras:** [cloud.cerebras.ai](https://cloud.cerebras.ai) — free tier available, very generous limits

> - **Fireworks:** [fireworks.ai](https://fireworks.ai) — sign up for an API key

Was this helpful? React with 👍 or 👎 to provide feedback.

Replaces #1 with current LightRAG examples: Kimi K2.6 via https://api.moonshot.ai/v1 for quality, Cerebras gpt-oss-120b for speed instead of deprecated qwen-3-32b, Fireworks Qwen3-Embedding-8B for high-quality embeddings, and local Ollama for free/private setup. Keeps README.md and part3-lightrag-setup.md in sync and preserves real LightRAG LLM_BINDING_HOST / EMBEDDING_BINDING_HOST env vars. Supersedes #1.

OnlyTerp · 2026-05-27T16:37:36Z

Handled this as a corrected replacement in #17. The structure from this PR (multiple LightRAG setup options) was good, but the concrete examples had gone stale:

Kimi should use the current slug and the international endpoint, not / .
Cerebras is deprecated; docs: refresh LightRAG model setup examples #17 uses current production instead.
Kept real LightRAG env vars (, ) and synced both + .

Closing this in favor of #17.

OnlyTerp · 2026-05-27T16:37:38Z

Superseded by #17, which lands the same LightRAG multi-option setup idea with current Kimi/Cerebras model IDs and endpoints.

devin-ai-integration · 2026-05-27T16:37:40Z

Devin is archived and cannot be woken up. Please unarchive Devin if you want to continue using it.

OnlyTerp · 2026-05-27T16:38:12Z

Clean note after close: #17 superseded this PR with the same LightRAG multi-option setup idea, but corrected the stale concrete examples.

Kimi uses kimi-k2.6 and the international https://api.moonshot.ai/v1 endpoint.
Cerebras uses current production gpt-oss-120b instead of deprecated qwen-3-32b.
The docs keep real LightRAG env vars: LLM_BINDING_HOST and EMBEDDING_BINDING_HOST.
Both part3-lightrag-setup.md and README.md are synced in docs: refresh LightRAG model setup examples #17.

devin-ai-integration Bot assigned OnlyTerp Apr 16, 2026

devin-ai-integration Bot force-pushed the devin/1776314159-lightrag-kimi-cerebras-update branch from 35e9a9f to e0244cb Compare April 16, 2026 19:46

devin-ai-integration Bot reviewed Apr 16, 2026

View reviewed changes

OnlyTerp mentioned this pull request May 27, 2026

docs: refresh LightRAG model setup examples #17

Merged

OnlyTerp closed this May 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update LightRAG setup: Kimi 2.5 entity extraction & Cerebras+Qwen3 options#1

Update LightRAG setup: Kimi 2.5 entity extraction & Cerebras+Qwen3 options#1
OnlyTerp wants to merge 1 commit into
mainfrom
devin/1776314159-lightrag-kimi-cerebras-update

OnlyTerp commented Apr 16, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

devin-ai-integration Bot commented Apr 16, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Apr 16, 2026

Uh oh!

OnlyTerp commented May 27, 2026

Uh oh!

OnlyTerp commented May 27, 2026

Uh oh!

devin-ai-integration Bot commented May 27, 2026

Uh oh!

OnlyTerp commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

OnlyTerp commented Apr 16, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Updates since last revision

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration Bot commented Apr 16, 2026

🤖 Devin AI Engineer

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

OnlyTerp commented May 27, 2026

Uh oh!

OnlyTerp commented May 27, 2026

Uh oh!

devin-ai-integration Bot commented May 27, 2026

Uh oh!

OnlyTerp commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

OnlyTerp commented Apr 16, 2026 •

edited by devin-ai-integration Bot

Loading